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a) 



It is well-known that a quantum measurement can enhance the transition 
probability between two quantum states. Such a measurement operates after 
preparation of the initial state and before postselecting for the final state. 
Here we analyze this kind of scenario in detail and determine which probabil- 
ity distributions on a finite number of outcomes can occur for an intermediate 
measurement with postselection, for given values of the following two quan- 
tities: (i) the transition probability without measurement, (ii) the transition 
probability with measurement. This is done for both the cases of projective 
measurements and of generalized measurements. Among other constraints, 
this quantifies a trade-off between high randomness in a projective measure- 
ment and high measurement-modified transition probability. An intermediate 
projective measurement can enhance a transition probability such that the 
failure probability decreases by a factor of up to 2, but not by more. 
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I. INTRODUCTION 



It is a puzzling property of quantum theory that a measurement on a physical 
system can change the state of that system in a drastic way. A well-known demon- 
stration of this can be made with polarizers (figured]): upon shining a beam of light 
onto two orthogonally aligned polarizers, no light at all passes through both of them. 
However after placing a third polarizer in between the two, such that this new one is 
not aligned with either of the other two, it is suddenly possible for some light to pass 
through the whole setup. Hence the middle polarizer, functioning as a projective 
measurement, has increased the transition probability from zero to a positive value! 

Using the simple geometry of a two-state quantum system e.g. in the Bloch 
sphere, it is not hard to see that the maximal measurement-modified transition 
probability in this polarizer scenario can be at most |. But what about other cases 
like c?-dimensional Hilbert spaces of states — what is the maximal modified transition 
probability then? Or what if the unmodified transition probability does not vanish? 
And how do the original and the modified transition probability relate to the statistics 
of the measurement? These questions are what we are concerned with here — mostly 
for the case of projective measurements, but also for generalized measurements. 

The answers to these questions are statements saying that certain things are pos- 
sible in quantum theory, while other things are not. Hence these answers might in 
principle be of interest for further high-precision experimental tests of the quantum 
formalism. For example, it seems conceivable that models with dynamical wavefunc- 
tion collapse make different predictions than orthodox quantum theory does. 

The natural framework for our considerations is the two-state vector formalism 
of Aharonov and Vaidma n AV07 . There, quantum theory becomes time-symmetric by 
considering two state vectors for a quantum system: an ordinary state vector 
evolving forward in time, and an additional state vector \<p) evolving backward in 
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FIG. 1. Enhancing transition probabilities by measurement. We regard the first polarizer 
as state preparation, the last polarizer as postselection, and the intermediate polarizer as 
a measurement. 



time. While \if)) is to be interpreted as an initial state, \<f>) should be thought of as 
a target final state: after all other measurements have been done, the experimenter 
measures the projection operator \4>)(<j)\ and records the results of the experiment 
in case of a positive result, while discarding the whole run in case of a negative 
result. This procedure is known as postselection with respect to \<f>). The polarizer 
example from above fits into this framework: the first polarizer can be regarded as 
preparation of the initial state while the final polarizer conducts a postselection 
with respect to a final state \<f>). The measurement statistics obtained by such a 
procedure are the statistics of the postselected ensemble. Postselected ensembles 
can show very counterintuitive behavior: examples are the phenomenon that the 
so-called "weak value" of an observable can be bigger than the observable's largest 
eigenvalu o AAV88 , or the three-boxes thought experiment which displays a high degree 
of contextualit y AV07 . Another unexpected property of postselected ensembles has 
then been found i n 1 ^"' 10 (see section [Til), and this is the line of investigation to be 
continued here. The present article should be readable without knowledge of any of 



the references mentioned. 

Synopsis. Section [XX] states the problem studied here and recalls some results 
from a previous pape r^' 10 about dichotomic measurements with postselection. The 
main result of the present work then follows in section IHH a characterization of all 
triples (T, S,P(-)) allowed in quantum theory, where T is the transition probability 
without measurement, S is the transition probability with measurement, and P(-) is 
the statistics of the intermediate n-outcome projective (resp. generalized) measure- 
ment. After that, section IIVI discusses some particular special cases of this result 
and determines to what extent transition probabilities between quantum states can 
be enhanced by a projective measurement. The latter two sections frequently refer 
back to the mathematical appendix |A] Finally, section |V] presents a brief conclusion. 

Notation. While Dirac notation appears throughout the main text, but not in 
the mathematical appendix. Sometimes we use expressions like {<p\A\ip) also when 
A is not hermitian. In this case, we stipulate that A acts to the right on the vector 
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II. STATEMENT OF THE PROBLEM AND PREVIOUS RESULTS 



Outcome probabilities for ensembles with posts election. As has also been outlined 
in the introduction, we consider a quantum system subject to the following kind of 
procedure: 

(a) prepation of some initial state 

(b) application of a projective (or generalized) measurement with n outcomes, 

(c) postselection 7 with respect to some final state 

We assume that these three consecutive steps happen almost instantaneously, so 
that the dynamics of the system can be neglected. This is not an essential restriction 
since we can always take to be the actual initial state modified by time evolution 
until the time of measurement, and similar for \<p). Also it is no loss of generality to 
take both and \4>) as pure states, since a mixed state can always be purified by 
adding an ancilla to the system with which it is entangled (see e.g . NC0 ° (2.5); for the 
purification of both and we might have to add two ancillas). 

Concerning the intermediate measurement, we will consider the cases of projective 
measurement and of generalized measurement separately. 

We now calculate the outcome probabilties of the intermediate measurement on 
the postselected ensemble. The measurement is taken to be defined in terms of Kraus 
operators 14, k 6 {1, . . . ,n}, with ^2 k V^Vk = 1. It will be assumed for simplicity 
that the measurement is fine-grained, i.e. that to each outcome k corresponds exactly 
one Kraus operator this is enough for our main result IIII.lt an d it should be 
clear how to extend the following considerations to the general case. With these 
assumptions, the probability of getting the outcome k in conjunction with successful 



postselection on the post-measurement state 



is given by the product of 
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the two respective probabilities as 

So the cancellation between the normalization of the post-measurement state and 
the outcome probability gives a surprisingly simple formula for the probability of 
getting the outcome k in the postselected ensemble: 



Here, the normalization factor 

is the success probability of the postselection, i.e. the probability that the final 
measurement of \<ft){4>\ will give a positive result. We may also regard S as the 
measurement-modified transition probability. However in order to have a clearer 
terminology, we will reserve the term "transition probability" for T = |(0|?/;)| 2 , and 
refer to S as the "success probability" . 

Note that the formalism is time-reversal invariant in the sense that the roles of 
the initial state and final state can be interchanged without changing the outcome 
probabilities or the success probability. 

For the case of projective measurements, the Kraus operators should be taken to 
be a complete set of projection operators, 

v k = u k with nj = n fc , n 2 = n fc , ^n fc = i. 

k 

With this replacement we obtain for the outcome probabilities the Aharonov- 
Bergmann-Lebowiz formula (eq. (9) i n AV07 , see also ABL64 ), 

p(k) _ (2) 



with the success probability 

s = j2\(mm 2 

as normalization factor. 

Introducing the problem. The problem to be solved is the following: 

Question II. 1. Given the transition probability T = |(0|?/;)| 2 G [0, 1], which proba- 
bility distributions P(-) on {1, ... , n} are outcome distributions of a projective (resp. 
generalized) measurement for which values of the success probability S G (0, 1]? 

We will only consider the case that the success probability S is strictly positive; 
for vanishing 5*, the postselected ensemble is empty, and hence the probability dis- 
tribution P(-) is not defined. 

The operational significance of question III. II is as follows. All quantities T, 5* and 
P(-) are in principle experimentally measurable. We imagine that some experiment 
has provided us with concrete values for these quantities. Then the task is to find a 
quantum-mechanical model reproducing these particular values, without specifying 
the Hilbert space dimension in advance, and assuming that the measurement is 
projective (resp. generalized). Our main result [TTT7T1 then tells us directly whether 
this is possible or not. Now as already mentioned in the introduction, this could 
be useful for actual high-precision experimental tests of the quantum formalism, 
and help to distinguish e.g. models of dynamical wavefunction collaps o Pea99 from 
orthodox quantum theory, where wavefunction collapse happens instantaneously. 

Previous results. The surprising results o f Fril ° have been a strong motivation for 
the present work. There, the case T = and n = 2 has been treated in section 2, and 
it was found that the only possibility is given by P(l) = P(2) = ±, independently of 
S. This is actually easiest to see on the level of amplitudes, where it follows from 

o = (0|^) = (0|n 1 |^) + (0|n 2 |^), 
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so that the two probabilities for measuring 1 or 2 are given by, respectively, 

K01iW)| 2 _i , |(0in 2 |y>)| 2 _i 

lJ K0|n x |v»)| 2 + |^|n 2 |^>| a 2' u i^n^ + l^in^)! 2 2" 

Intuitively, this means that a dichotomic projective measurement with postselection 
which is orthogonal to the inital state is guaranteed to be a perfectly unbiased random 
number generator. 

III. MAIN RESULTS 

Using the elementary mathematical results listed in appendix |Aj we are now ready 
to answer question III. II in generality. 

Theorem III.l. (a) A given probability distribution P(-) with given T G [0,1] 
and S G (0, 1] can occur via a projective measurement if and only if all the 
inequalities 

j^k k v 

hold. 

(b) With a generalized measurement, any combination of values for P(-), T and S 
can occur. 

Proof. We start with the proof in the projective measurement case. The main idea 
here is to use the completeness relation Ufc = 1 in order to obtain an identity for 
amplitudes 

(<t>m = x>iiw> 

k 

and then translate this into conditions on the probabilities ([2]). To this end, we can 
apply lemma IA.1I to 

z k = (0|IL.|V>), k = l,...,n, z n+1 = -(4>\ip). 
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For then upon setting Xk = a/ P(k)S = |(0|IIfc|"0)| for k = 1, ...,n, and defining 
x n+ i = y/T, it follows that the left-most inequalities of ([3]) are necessary, as well as 
the first inequality of the second formula. 

The remaining inequality follows from two applications of the Cauchy-Schwarz- 
inequality as follows: 



E 1**1 = E < E VW^-VWw) < J2^ U ^-JJ2^ U ^ 

k k k y k y k 

as was to be shown. 

To see that the inequalities fl3]) taken together are also sufficient for the existence 
of a quantum-mechanical model, we again set Xk to be given by the square roots of 
the unnormalized probabilities as Xk = a/ P(k)S for k = 1, . . . , n, and again define 
x n+ i = a/T. Then once more by IA.1| some compatible z^'s with Y^k=l z k — can 
now assumed to be given, and they also satisfy Y^l=i \ z k\ — Sfc=i x n < 1 by the 
assumption ()3]). Now one can use lemma \K?2\ to obtain the states on C™ which are 
given by 

n n 

i^) = E^i^), i0) = E^i fc ) 
k=i ?c=i 

in conjunction with the projection operators 11^ = \k)(k\ for k = l,...,n. Then 



y/P(k)S = \((j)\Ilk\ip)\ and T = |(0|^)| 2 both hold by construction. The requirement 
S = K^lilfcl^)! 2 is automatic by normalization of the probability distribution 
P(-). This ends the proof in the projective measurement case. 

In the generalized measurement case, we will construct |0) and Vk which 
reproduce the given data. We first choose any unit vectors and \<f>) satisfying 
K^l'?/')! 2 = T. Now in Hilbert space of dimension at least n, it is possible to find 
a complete set of mutually orthogonal projectors n fc such that P{k) = (-0 1 IT fc | ■0); if 
one would measure these on the ensemble defined by the initial state without 
postselection, one would obtain the given distribution P(-). Now fix some unit vector 
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with |(0'|0)| 2 = S. Then for those k with P(k) > 0, there exists a unitary Uk 
which maps the unit vector P(k)~ 1 ^ 2 Uf : \^P) to \<p'). We take = U^Ilk, while setting 
Vk = for those k with P(k) = 0. What we have thus constructed is a generalized 
measurement in which the post-measurement state is always | </>'); this guarantees 
that the measurement statistics on the initial state and the postselection are 
probabilistically independent. Hence by construction, the desired statistics P(-), T 
and S have been reproduced. □ 

It is possible to rewrite the inequalities (EJ) in a more convenient form. Since the 
left-most inequality holds for all k if and only if it holds for that k for which P(k) is 
largest, it is enough to require 



2 y JmBxP(k) < ^1 + E V 7 ^) 



In terms of the diversity indices ( 1A4[) . flA5j) 



D1/2 = (j2 V 7 ^)) 



D oo=-— A^77X> D 1/2 =\yWP(k)\ (J) 



maxfc P(fc) 

we can see that the inequalities ([3]) are in fact equivalent to 




(5) 

so that the dependence on the distribution P(-) is only through the dependence on 



the quantities Doo and Di/ 2 . By ( 1A6I) . the allowed interval for y/T/ S is always 
non-empty. 

Remark III. 2. (a) The proof of the theorem shows that it is sufficient to employ 
Hilbert spaces of dimension at most n. For projective measurements, this is 
clearly best possible. For generalized measurements however, the number of 
outcomes is not related to the Hilbert space dimension, and so it might be 
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interesting to study how much the existence of a quantum-mechanical model 
depends on Hilbert space dimension. Since the proof above still involves many 
arbitrary choices, it seems conceivable that one can cover a sizeable part of the 
space of triplets (P(-), T, S) e.g. by qubit models. 

The diversity indices and -D1/2 are simply the exponentials of the min- 
entropy and the Renyi ^-entropy, respectively: 

H 00 = \ogD 00 , H 1/2 = logDi/2. 



The right-most inequality in ((5]) states that 



Intuitively, this means that high randomness in the measurement implies a 
low success probability. So in order to achieve a high success probability, one 
needs to choose a projective measurement with not too much randomness on 
the postselected ensemble. 

A very nice example of how to control transition amplitudes by measurements 
is the Aharonov-Vardi effec t AV8 ° , a variant of the quantum Zeno effect. The 
observation is that any given quantum dynamics \ip(t)) can be approximately 
simulated by starting with the initial state ^(io)) an d conducting projective 
measurements \ip(t n )){ip(t n )\ at the times t n = to+n-St, with n G N. Aharonov 
and Vard i AV8 ° have shown in particular that for St — )■ 0, the probability of ob- 
taining any target state \ip{tf)) at any final time tf approaches unity. Since 
such a sequence of projective measurements can also be seen as a single gener- 
alized measurement, this illustrates part (b) of the theorem. 
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IV. DISCUSSION 



Let us now look at some specific cases of theorem IIH.l( a). So in this section, 
"measurement" always means "projective measurement". 

Case T = with S arbitrary. This is the case that has been studied i n 1 ^ 110 for 
n = 2. As long as we allow the success probability S to be arbitrarily small, all that 
remains are the inequalities 



For n = 2, this reads y/P(l) < y/P(2) and y/P(2) < y/P(l), implying that P(l) = 
P(2) = \. Hence a dichotomic measurement with postselection which is orthogonal 
to the initial state is guaranteed to be a perfectly unbiased random number generator 
(see section HI]) . The n = 3 case is illustrated in figure [2] one obtains a circular 
disk within the probability simplex. This can be shown from ([6]) by squaring the 
inequalities, rearranging, and then squaring again while taking care of the signs. 
This eventually leads to the quadratic inequalities 



for the circular shape of the quantum region in figure [2j Also, just as it should due 
to the result for the n = 2 case, the n = 3 region intersects with any side of the 
triangle in exactly the middle of that side. So whenever the final state is orthogonal 
to the initial state, any intermediate projective measurement with three outcomes 
needs to show statistics lying in this disk. 

For arbitrary n > 2, one can at least say that the *JP(k) always lie in a certain 
subset of W 1 which is the convex cone defined by the inequalities (j6]). Since these n 
inequalities are linearly independent in M n , for dimensional reasons this convex cone 
is a simplex, i.e. the conical hull of n linearly independent extreme rays. One can 




(6) 



(P(l) - P(2) - P(3)) 2 < 2P(2)P(3) + cyclic permutations 
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FIG. 2. The quantum-mechanical region within the probability simplex for three mea- 
surement outcomes, T = (orthogonal postselection), and arbitrary success probability S. 
This is a ternary plot, i.e. each vertex stands for a definite outcome, and each point inside 
the triangle represents a probability distribution over the vertices. 

calculate the mth extreme ray by requiring all inequalities except for the mth one 
to be saturated. Solving the ensuing system of linear equations shows that the mth 
extreme ray y m has the coordinates 

yf = 1 + (2 - n)6 jm 

Hence for any P(-) satisfying one can find non-negative real numbers A m such 
that 

m 

Case S fixed, P(-) fixed. The first two inequalities of fl5]) define an interval of 
possible values for the transition probability T. This can be interpreted as follows: 
by knowing the behavior of the system with measurement, it is possible to predict 
something about how the system would behave without measurement. 

Case T > fixed, P(-) fixed, S arbitrary. Here, it is possible for any P(-) to find 
some appropriately small success probability S such that all inequalities in dSJ) hold 
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FIG. 3. For n = 2 (dichotomic measurement), the possible quantum-mechanical success 
probabilities S as a function of the outcome probability p. High randomness in the mea- 
surement decreases the maximal probability of successful postselection, i.e. the maximal 
measurement-modified transition probability. 



(e.g. S = ^p— ), so no constraints abound. This is one reason why it is important to 
always consider S as an additional parameter. 

Case n = 2 with T and S unspecified. Here, the two probability values -P(l) and 
P(2) determine each other uniquely, so let us write P(l) = p and P(2) = 1 —p. Then 
the inequalities are 

T 



\y/p- Vi^p\ <\l^< Vp+V^p < ^ (7) 

The projection of this into the p-S -plane, where only the last inequality is relevant, 
is shown in figure El For fixed S, some sections of the quantum region are graphed in 
figure HI The first two inequalities of (j7J) define the upper and lower boundary curves 
in these figures, while the third inequality leads to vertical cuts whenever S > ~. 
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FIG. 4. Again n = 2 (dichotomic measurement). These plots show the quantum-mechanical 
region for (p, T) for some values of S. The vertical cuts for S > \ appear due to figure[3l For 
S — > 1, these cuts rapidly approach the p = and p = 1 axes. One possible interpretation 
is that knowing the system behavior with measurement (i.e. S and p) lets us say something 
about system behavior without measurement (i.e. T). 

The T-S-region. How does the transition probability relate in general to the 
probability of successful postselection? To study this, it is best to consider the in- 
equalities in the form ([5]). Figure [5] shows an illustration of the following proposition. 

Proposition IV. 1. For a given number of outcomes n, some success probability S 
can appear in quantum theory together with some transition probability T if and only 

*f 

T T -L. 1 

(8) 



!<s<I±i 

n ~ ~ 2 



Proof. Again it is first shown that these inequalities are necessary. Since Di/ 2 < n, 
the second inequality in ([5]) implies that 

T < nS. 

For proving the second inequality of ([8]), we distinguish two cases. If, firstly, the 
left-most term of ([5]) is non-negative, we can square the left-most inequality of ([5]) 
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and use it as follows: 



T 1 © 4 [d^z 
+ - > — + 2D 1/2 - V " 



S S ~ ' ±/z Do 



£>l/2 



Doc V V 7 



so that the desired result follows from lemma lA.4( a). The second case is that the 
left-most term of ([5]) is negative, which means that Di/ 2 > j^—. If < 2, we have 
therefore D 1/2 > 2, so that S < \ by S < Finally if D M > 2, then (jX6j) also 

1/2 

shows that Du% > 2, giving the same conclusion 5 < ^. In all cases, the second 
inequality of (JHJ) has therefore been verified. 

For checking sufficiency of (jHJ), consider first the case that - < S < T. Then 
by lemma POI b). it follows that the first inequality of ([5]) holds automatically. The 
possible values for Di/ 2 are given by the closed interval [l,n]. Hence it is possible 
to find some value for Dy 2 in this interval which also satisfies (jSJ) whenever ^= > 1, 
which holds trivially, and < y/n, which is true by assumption. This ends the 
proof in this case. 

It remains to prove sufficiency when T < S < Here, it is in fact enough to 

consider probability distributions P(-) supported on two elements, which brings us 
effectively down to the dichotomic case n = 2 from equation (J7J). By < 1, the 
middle inequality of (J7J) is automatic, so one only needs to take care of the remaining 
two. These in turn can be written as 

1-2Vp(1-p)<|, l + 2y/p(l-p) < i 



Upon choosing p such that the second inequality is saturated, one finds that the first 

r+] 
s 



inequality is satisfied as long as > 2. □ 



So this result gives clear bounds on how much a measurement can enhance or 
reduce a transition probability. It has been found that a measurement can reduce a 
transition probability by a factor which is given by the number of outcomes of the 
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measurement. This becomes intuitive when one thinks of the measurement — with 
outcomes discarded — as a decoherence process which can drive the system's state 
towards a totally mixed state or highly mixed state. 

The situation for enhancing transition probabilities by measurement is very dif- 
ferent. We can rewrite the first inequality of flB]) more conveniently in terms of the 
failure probabilities 1 — T and 1 — S, where it reads 



Hence, a measurement can lower the probability that a desired state transition fails 
by a factor of up to 2, but not by more. The proof above has shown that this enhance- 
ment can already be achieved by a two-outcome measurement. This is again intuitive 
in terms of the decoherence due to measurement: for creating a successful transition, 
it would be useless to try to measure a projection operator with support outside of 
the linear span lin{|-?/>), | </>)}. Therefore, a transition-enhancing measurement should 
have non- vanishing probability on exactly two outcomes. 



V. CONCLUSION 

It is a well-known phenomenon that measurements influence transition probabil- 
ities between quantum states. In this article, we have conducted a systematic study 
of this phenomenon and determined how it relates to the outcome distribution of the 
intermediate measurement on the corresponding postselected ensemble. It has been 
found that a given probability distribution for a projective measurement can appear 
in conjunction with a given transition probability and a given success probability of 
the postselection if and only if certain inequalities hold. These inequalities depend 
on the probability distribution only through its min-entropy and its Renyi |-entropy. 
Furthermore, no conditions at all abound if the measurement is allowed to be any 
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FIG. 5. The quantum region of transition probabilities: T is the transition probability 
without measurement, while S is the transition probability with n-ary projective measure- 
ment. All points above the dashed diagonal S = T represent a measurement-enhanced 
transition probability. 

generalized quantum measurement. 

As a consequence of these results, it was possible to bound the enhancement of 
transition probabilities by projective measurements. The maximal enhancement can 
be achieved with two-outcome measurements and is such that the failure probability 
decreases by a factor of 2. 
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Appendix A: Mathematical appendix 

Here we collect various elementary mathematical facts which are referenced from 
the main text. 

It is known (see e.g . Pm05 ) that a finite sequence of non-negative real numbers, 
x\, . . . ,x n is the sequence of edge lengths of a polygon in the Euclidean plane if and 
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only if the inequalities 

%k < ^ gj (Al) 

hold. Since they generalize the triangle inequality, these inequalities are known as 
polygon inequalities; on the other hand, the triangle inequality immediately implies 
that the polygon inequalities are necessary for the existence of a polygon with these 
edge lengths. See e.g . S03 for another occurence of the polygon inequalities in quan- 
tum information theory. 

This geometrical statement directly implies the following: 

Lemma A.l. Given non-negative real numbers x\, . . . , x n , there exist complex num- 
bers zi, . . . , z n with 

\%k\ — X k, Zk = 

k 

if and only if the inequalities 
hold. 

Lemma A. 2. For n > 2 and any z G C n , there exist ip,<p G C n with 

\\ip\\ 2 = 1 = ||0||a, ip k (p k = z k Vfc = l,...,n 

if and only if the inequality 

£}W<1 (A3) 

it 

Proof. Necessity of flA3j) is nothing but the Cauchy-Schwarz inequality: 



E \ Zk \ = e m ■ m < . /E i^i 2 • . /E i^i 2 ^ 1 



20 



That (1A3j) is also sufficient for the existence of such ip and will be shown by 
induction on n. Note that the phases of z k can be changed arbitrarily without 
altering the (non-)existence of such vectors, hence we may as well assume that all 
Zk are non-negative real numbers. We now prove the statement for the initial case 
n = 2. By the assumptions z\, z<i > and Z\ + z 2 < 1 , it is implied that \zi — z 2 \ < 1, 
and therefore it is possible to find angles a and such that 

cos(a + (3) = cos a cos (3 — sin a sin (3 = Z\ — z 2 , 
cos(a — (3) = cos a cos (3 + sin a sin /3 = Z\ + z 2 - 

Hence the two vectors 

(cos a \ / cos/3 

' (t>= \ 

sin a I \ sin j3 

have all the required properties. 

The induction step is a simple rescaling argument. Given z%, . . . , z n+ \ > with 
Z) fc z k < 1, define z[, . . . , z' n as 

z fe = - , k = 1, . . . , n. 

-L — Z n+ i 

(We may assume z n+ \ ^ 1 e.g. by reordering the z^s.) Then by induction assump- 
tion, we can find G C n with H^'lb = 1101b = 1 an d ^101 = ^fc- Now the two 
vectors 

ViV 1 - z n+i for = 1, . . . ,n I 0fcVT - ^n+i for = 1, . . . ,n 

4>k= < 

y/z n+ i fork = n + l I y/z n+ i ior k = n + 1 

do indeed have the desired properties HVIb = ||0|b = 1 and ip k 4>k = Zk, which also 
finishes the induction step. □ 

The following fact can also be regarded as a special case of the Holder inequality, 
but since a direct proof is extremely simple, we have included it here. 
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Lemma A. 3. Let x G M> with Ylk x k = 1- Then, 

2 



V] i/ifc • max x k > 1 



Proof. This is easily shown by a direct calculation: 



• max v^fc 



so that squaring gives the desired result. □ 

Diversity indices. A diversity inde x Jos06 , as used for example in biostatistics, is a 
function that assigns to each probability distribution a real number which is intended 
to measure a sort of effective cardinality contained in the probability distribution. 
In other words, a diversity index is an exponentiated entropy. Like in the main text, 
let P(-) be a probability distribution on {1, . . . , n}. For each q e (0, oo), one obtains 
a diversity index D q by defining 

D g (P)=(^£P(kyY 9 (A4) 

For q = 1, this has to be understood as lim q ^i D q , which is the exponentiated 
Shannon entropy. In a similar way, it is possible to define Dq(P), which is the 
cardinality of the support of P, and D^P), which turns out to be 

D - {p) = ^Wy (A5 » 

The relevant quantities for us are going to be Di/ 2 and D^. When P is the uniform 
distribution on n elements, we have D q (P) = n for all q. 

In this notation, we get a simple reformulation of lemma IA.31 

A,c < D 1/2 . (A6) 
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Lemma A. 4. (a) 

(b) 



2 > 

- 1 



v 7 ^ 

Proof. (a) Let fc G {l,...,n} be such P(/co) is the highest probability in the 
distribution, i.e. P(ko) = maxfcP(/c). Then, 

2 



(b) This is trivial by > 1 and Dy 2 > 1- 

□ 
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